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Express Mail No. EL550124001US 
METHOD OF COMPILING SCHEMA MAPPING 



TECHNICAL FIELD 

The present invention relates to mapping source schemas with target schemas. 
More particularly, the invention relates to a method of compiling a mapping between a 
source schema with a target schema in computer systems. 

BACKGROUND OF THE INVENTION 

The internet is enabling businesses to develop new ways to facilitate efficient 
and automated interactions between their own internal line of business, productivity 
and knowledge management applications, the applications used by their customers and 
trading partners, and services provided by their commercial and corporate providers. 
The challenges associated with enabling such efficient and automated interactions 
between apphcations across business boundaries in a cost effective manner are similar 
to those associated with enabling them within an enterprise or departmental boundary. 
Building business-to-business (B2B) e-commerce systems presents many challenges to 
the system architect. Often, each company involved stores their data and documents in 
formats that are different from the formats used by the other participating companies. 
These companies need a way to integrate what tend to be vastly different systems. In 
addition, Litemet security issues are involved in business to business data interchange. 
Thus, several challenges are faced by businesses, governments, and other entities 
wishing to interact via electronic documents and other forms of electronic commerce. 

The challenges to interaction across business boimdaries include lack of a 
sufficiently flexible and rich universal language to specify, package, publish, and 
exchange both structured and unstructured information across apphcation or business 
boundaries, lack of a flexible and rich universal language to specify, package, publish, 
and execute transformation rules to convert information from one format to the other as 
application and business boimdaries are crossed, and lack of middleware-neutral 
application level communication protocols to enable automated interactions across 
application or business boundaries. The extensible Markup Language (XML), and 
XML-based schema languages enable businesses and other users to describe and 
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exchange structured information between collaborating applications or business 
partners in a platform and middleware neutral manner. 

Domain specific standards bodies and industry initiatives have started to adopt 
XML and XML-based schema languages to specify both their vocabularies and content 
models. These schemas are becoming widely published and implemented to facilitate 
communication between both applications and businesses. Wide support of XML has 
also resulted in independent solution providers developing solutions that enable the 
exchange of XML-based information with other external 3""^ party or custom developed 
applications. Several solution or middleware/platform specific approaches have been 
taken to address the lack of middleware-neutral apphcation level communication 
protocols. 

However, no single proprietary solution or middleware platform meets all the 
needs of a complex deployment environment. In addition, the cost and complexity of 
integrating together ERP systems, inventory management systems, or sales tracking 
systems within a single organization strains the capabilities of most large and small 
companies ahke. In the near future, electronic commerce is going to require massive 
amounts of application integration across multiple organizations, as trading partners 
turn to the Internet to automate supply chains, forecasting systems, and government 
services, and new types of business interchanges. 

Although XML provides a standardized format for document definition, current 
development tools require system developers to create code for mapping individual 
source document schemas with target schemas in order to facilitate the interchange of 
information between business organizations or entities. In a typical example, two 
companies may create schemas for their documents in XML. Where these companies 
want to exchange information via internal documents, an XML schema is created for 
documents at each company, and a system developer generates code mapping 
information transfer between the source and target document schemas. Such mapping 
code may be developed in a language such as extensible Sheetstyle Language (XSL). 
A developer may thus create an XSL style sheet representation of a mapping between 
source and target schemas. 
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XSL at its most basic provides a capability similar to a "mail merge." An XSL 
style sheet contains a template of the desired target structure, and identifies data in the 
source document to insert into this template. This model for merging data and 
templates is referred to as the template-driven model and works well on regular and 
repetitive data. XSL also provides capabilities for handling highly irregular and 
recursive data such as is typical in documents. Template fragments are defined, and the 
XSL processor combines the results of these fragments into a final result tree based on 
the shape of the source data. Each template fragment declares the type and context of 
source nodes it is appropriate for, allowing the XSL processor to match source nodes 
with template fragments. This model is known as the data-driven model since the 
shape of the data drives the shape of the final output. An XSL transformation process 
is thus independent of any particular output grammar and can be used for translating 
XML data from one schema to another. 

Heretofore, system developers have provided code defining the 
interrelationships between XML business documents, based on the desired information 
to be exchanged via electronic commerce applications. However, for each source and 
target document set, code must be written mapping the relationships therebetween. 
Whether the code representing such a mapping is written in XSL or another language, 
writing this code is difficult and typically beyond the capabihties of ordinary business 
personnel, hnproved tools are needed for business personnel to define document 
mappings for business to business applications, as well as to generate mapping code, 
which do not require extensive knowledge of software programming. 

SUMMARY OF THE INVENTION 

A method and system are provided for compiling a mapping between two 
documents or schemas, which generates a coded mapping. The invention takes a user- 
defined mapping between a source schema representing the definition of a source 
document, and a target schema representing a target document, and compiles the 
mapping into code for execution in a computer system. The represented documents 
may be XML documents, and the generated code may be, for example, extensible 
sheetstyle language (XSL). The mapping may be defined, for example, graphically 



3 



MS147163.1 

using a graphical user interface, which the compiler system and method of the 
invention uses to generate code, hi this way, an unsophisticated user may define 
relationships between business documents used across business and/or application 
boundaries, and generate code capable of handling the information transfers 
5 therebetween, without having extensive knowledge of computer programming 
languages. 

In accordance with one aspect of the invention, there is provided a method of 
generating code representing a mapping between a source schema and a target schema. 
The mapping includes data transformations from the source schema to the target 
10 schema. The source schema includes a source tree having one or more source nodes 
and the target schema includes a target tree having one or more target nodes. The 
method comprises determining source node dependencies for the target node by tracing 
Q from the target node through the mapping to the source schema, matching hierarchy by 

generating a hierarchy match list for the target node, and generating code according to 
05 the hierarchy match Ust. By the method of the invention, code may be generated from a 

mapping. For example, a graphical representation of data transformations fi-om a 
source schema to a target schema may be compiled into an XSL style sheet 
,1 representation of the mapping. The method may further comprise initializing node 

5 dependencies memory, includmg allocating memory for a compiler node, associating 

^10 the compiler node with the target node, allocating memory for compiler variable 

p classes, and associating compiler variable classes with functoids. 

According to another aspect of the invention, a method is provided for 
compihng a mapping between a source schema having source nodes associated 
therewith, and a target schema havmg target nodes associated therewith. The method 
25 comprises determining source node dependencies for at least one target node by tracing 
from at least one target node through the mapping to the source schema, matching 
hierarchy by generating a hierarchy match Ust for the at least one target node, and 
generating code according to the hierarchy match list. 

According to yet another aspect of the invention, a system is provided for 
30 generating code representing a mapping between a source schema and a target schema, 

wherein the mapping comprises data transformations from the source schema to the 
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target schema, the source schema comprises a source tree having a source node and the 
target schema comprises a target tree having a target node. The system comprises 
means for determining source node dependencies for the target node by tracing from 
the target node through the mapping to the source schema, means for matching 
5 hierarchy by generating a hierarchy match hst for the target node, and means for 

generating code according to the hierarchy match Ust. 

According to yet another aspect of the invention, a computer-readable medium 
is provided, having computer-executable instructions for generating code representing a 
mapping between a source schema and a target schema, the mapping comprising data 
10 transformations from the source schema to the target schema. The source schema 

comprises a soxarce tree having a source node and the target schema comprises a target 
tree having a target node. The computer-readable medium further comprises computer- 
P executable instructions for determining source node dependencies for the target node 

:^ by tracing from the target node through the mapping to the source schema, 

05 matching hierarchy by generating a hierarchy match list for the target node, and 

in generating code according to the hierarchy match list. 

%^ To the accomplishment of the foregoing and related ends, the invention 

comprises the features hereinafter fully described and particularly pointed out in the 
ff^ claims. The following description and the annexed drawings set forth in detail certain 

^10 illustrative embodiments of the invention. These embodiments are indicative, however 

O of but a few of the various ways in which the principles of the invention may be 

employed. Other objects, advantages and novel features of the invention will become 
apparent from the following detailed description of the invention when considered in 
conjunction with the drawings. 

25 

BRIEF DESCRIPTION OF THE DRAWINGS 

The foregoing and other aspects of the invention will become apparent from the 
following detailed description of the invention and the attached drawings in which: 

Fig. 1 is a flow diagram illustrating a method of generating code representing a 
30 mapping between a source schema and a target schema in accordance with the present 

invention; 
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Fig. 2 is a schematic diagram illustrating an XSL engine translating a source 
XML document into a target XML document according to an XSLT map in accordance 
with the invention; 

Fig. 3 is a flow diagram illustrating an exemplary first pass of an exemplary 
5 compiler method in accordance with the invention; 

Fig. 4A is a flow diagram illustrating an exemplary second pass of an 
exemplary compiler method in accordance with the invention; 

Fig. 4B is a flow diagram further illustrating the second pass of the method of 
Fig. 4A; 

10 Fig. 4C is a flow diagram further illustrating the second pass of the method of 

Figs. 4A and 4B; 

Fig. 4D is a flow diagram further illustrating the second pass of the method of 
n Figs. 4A , 4B, and 4C; 

;^ Fig. 4E is a flow diagram further illustrating the second pass of the method of 

05 Figs. 4A -4D; 

m Fig. 4F is a flow diagram further illustrating the second pass of the method of 

[J Figs. 4A -4E; 

r: Fig. 4G is a flow diagram further illustrating the second pass of the method of 

m Figs. 4A -4F; 

^ Fig. 4H is a flow diagram further illustrating the second pass of the method of 

3 Figs. 4A -4G; 

'""^ Fig. 5 A is a flow diagram illustrating an exemplary third pass of an exemplary 

compiler method in accordance with the invention; 

Fig. 5B is a flow diagram further illustrating the third pass of the method of Fig. 

25 5A; 

Fig. 5C is a flow diagram further illustrating the third pass of the method of 
Figs. 5A-5B; 

Fig. 5DC is a flow diagram further illustrating the third pass of the method of 
Figs. 5A-5C; 

30 Fig. 5E is a flow diagram further illustrating the third pass of the method of 

Figs. 5A-5D; 
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Fig. 6A is a schematic diagram illustrating an exemplary hierarchy matching 
where a compiler hnk option is flattening in accordance with an aspect of the invention; 

Fig. 6B is a schematic diagram illustrating another hierarchy matching where a 
compiler link option is flattening in accordance with an aspect of the invention; 

Fig. 7A is a schematic diagram illustrating an exemplary hierarchy matching 
where a compiler link option is top-down in accordance with an aspect of the invention; 

Fig. 7B is a schematic diagram illustrating another hierarchy matching where a 
compiler link option is top-down in accordance with an aspect of the invention; 

Fig. 8 A is a schematic diagram illustrating an exemplary hierarchy matching 
where a compiler link option is bottom-up in accordance with an aspect of the 
invention; 

Fig. 8B is a schematic diagram illustrating another hierarchy matching where a 
compiler link option is bottom-up in accordance with an aspect of the invention; 

Fig, 9A is a flow diagram illustrating an exemplary fourth pass of an exemplary 
compiler method in accordance with the invention; 

Fig. 9B is a flow diagram further illustrating the fourth pass of the method of 
Fig. 9A; 

Fig. 9C is a flow diagram further illustrating the fourth pass of the method of 
Figs. 9A-9B; 

Fig. 9D is a flow diagram further illustrating the fourth pass of the method of 
Figs. 9A-9C; 

Fig. 10 is a flow diagram illustrating an exemplary fifth pass of an exemplary 
compiler method in accordance with the invention; 

Fig. 11 is a schematic diagram illustrating an exemplary mapping in accordance 
with the invention; 

Fig. 12 is a schematic diagram illustrating another mapping in accordance with 
the invention; 

Fig. 13 is a schematic diagram illustrating another mapping in accordance with 
the invention; 

Fig. 14 is a schematic diagram illustrating another mapping in accordance with 
the invention; and 
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Fig. 15 is a schematic diagram illustrating an exemplary operating environment 
in which the invention may be implemented. 

DETAILED DESCRIPTION OF THE INVENTION 

5 The following is a detailed description of the present invention made in 

conjunction with the attached figures, wherein Hke reference numerals will refer to like 
elements throughout. According to one aspect of the invention, a method and system 
are provided for compihng a mapping between two documents or schemas, which 
generates a coded mapping which may be executed in a computer system. The 
10 invention further contemplates a computer-readable storage medium having computer- 
executable instructions for compiling a document or schema mapping. The invention 
will be described hereinafter with respect to an exemplary implementation in which 
Q XML documents and extensible sheetstyle language (XSL) are used to illustrated the 

various aspects of the invention. However, it will be recognized that the invention 
Q5 comprises implementations for generating code in other languages as well. In addition, 

ij^ the invention contemplates that the source and target may be any type of object, for 

example, schemas, spreadsheets, databases, documents, and the like. The invention 
thus advantageously allows an unsophisticated to define relationships between business 
^ docimients used across business and/or application boundaries, and to generate code 

fiO capable of handling the information transfers therebetween, without having extensive 

p knowledge of computer programming languages. 

- Referring now to the drawings, Fig. 1 illustrates illustrating a multi-pass method 

2 of generating code representing a mapping between a source schema and a target 
schema in accordance with the invention. The compilation begins at step 4 after which 

25 node dependencies memory is initiaUzed in pass 1 of the compilation at step 6. 

Thereafter, path 2 of the compilation is performed at step 8 wherein the correct source 
loop path is picked, hi addition multiple source loop paths are detected at step 8. At 
step 10, the third pass of the multi-pass compilation 2 proceeds with 
hierarchy matching, after which code generation is performed in pass 4 at step 12. 

30 Finally, node dependencies memory is freed at step 14 before the compilation 2 is 

completed at step 16. 
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The multi-pass compilation method 2 of Fig. 1 maybe advantageously used, for 
example, to generate an XSL style sheet for a visual representation of a map created by 
a user. In this regard, the user may generate a visual representation or mapping using a 
graphical mapping definition tool (not shown). The exemplary compilation 2, 
5 moreover, uses a multi-pass algorithm to achieve this task. Each pass processes some 
information for nodes in a target schema tree (not shown) and collects data that is used 
for subsequent passes. As will be discussed in greater detail hereinafter, a source 
document schema and a target document schema may comprise a tree structure, 
wherein nodes in the trees may be organized as records and fields. A record may be, 
1 0 for example, an XDR (XML data reduced) schema element that can contain other 

records and fields. A field is an XDR schema element or attribute that caimot contain 
other nodes (e.g., a leaf node). 
□ Referring now to Fig. 2, an application of the invention is illustrated 

schematically, wherein a system 20 includes a source XML document 22, a target XML 
C|5 document 24, with an XSL engine 26 therebetween. The XSL engine 26 may 

in comprise, for example, a computer system, which provides data transformations 

JJJ between the source XML document 22 and the target XML document 24 in accordance 

with an XSLT map 28. In this regard, the method 2 of Fig. 1 may be used to compile a 
^ mapping (not shown) into the computer executable instructions or codes, for example, 

(UO XSLT code (e.g., map 28), which is run by the engine 26 in performing the data 
Q transformation. 

- Figs. 3-5 illustrate a multi-pass compiler algorithm or method for generating an 

XSL style sheet for visual representation of a mapping created by a user. Each pass in 
the illustrated method includes a traversal algorithm defined by a traversal class. All 

25 passes except for code generation employ depth-first tree traversal of the target schema 

tree. Depth-first tree traversal includes traversing downward on the target tree, 
executing a pre-execute parent fimction for all parent nodes through a path all the way 
to a leaf node. When a leaf node has been reached, an execute-leaf fimction is called, 
after which the path is traversed upward back through the tree. On the return path 

30 upwards, a post-execute parent fimction is called for the tree nodes traversed. 
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The code generation pass uses field-attribute traversal wherein a compiler 
traverses all child field attributes before field elements and records. Code is written for 
all field attribute nodes under a parent node prior to generating code for record nodes 
and field elements. Thus, unlike pure depth-first traversal, in the field-attribute 
traversal, all fields nodes under a particular record node are processed before the record 
node itself is processed^ in the code generation pass. 

Each pass is encapsulated in a separate class. These classes inherit fi-om a 
single base class that is used by the traversal classes to traverse the destination or target 
schema tree. The base class defines three pure virtual fimctions. These include a pre- 
execute parent fianction, a post-execute parent function, and an execute leaf fiinction. 
Each of the derived classes representing the passes implement these three pure virtual 
fimctions. 

As used hereinafter the term tree node may include a C++ class that 
encapsulates a graphical user interface representation of a tree node. A compiler node 
may be a C++ class that is used for the XSL compiler related information. Compiler 
nodes are associated with the tree nodes in the target schema tree, hi addition, the term 
fimctoid encapsulates some fimctionality used for the visual mapping process. A 
flmctoid has zero or more inputs and a single output. A fimctoid manipulates the inputs 
(depending on the fianctoid) and provides the result in its output. Using a graphical 
mapping definition tool (not shown), a user may map data fi-om a source schema tree to 
a target schema tree, incorporating one or more fimctoids as required to implement the 
desired data transformation. 

PASS 1 - INITIALIZE MEMORY 

Referring to Fig. 3, pass 1 (30) of a multi-pass compiler algorithm or method 
begins at step 32 wherein node dependencies memory is initiahzed, after which 
compiler node memory is allocated at step 34. Thus, a portion of compiler memory 
may be allocated for each node in the target tree. Thereafter, compiler nodes are 
associated with target tree nodes at step 36, and memory is allocated for compiler 
variable classes at step 38, Compiler variable classes are then associated with fimctoids 
at step 40 after which pass 1 (30) completes at step 42. 

PASS 2 - SOURCE LOOP PATH SELECTION 
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Referring now to Fig. 4A, pass 2 (50) begins at step 52 wherein source node 
dependencies for target tree nodes are determined using depth-first tree traversal of the 
target tree schema. Target dependency lists and source dependency lists (not shown) 
are created for the nodes in the target tree at steps 54 and 56, respectively. For target 
tree nodes, the compiler then traces back to the source tree through all possible paths 
recursively, adding source tree nodes to the corresponding source dependency lists at 
step 58. A "GENERATESCODE" flag is then set for the compiler node associated 
with the target tree node, and its parent compiler node if there is a path from the target 
tree node to the source tree at step 60. If set, this flag may indicate to the compiler in 
later passes that the node needs to have code generated for it. It will be appreciated that 
if a child node generates code, so will a parent node associated therewith, and hence the 
"GENERATESCODE" flag is set therefor as well Thereafter at step 62, the target tree 
node is added to the target dependency list of its parent record node if the current target 
tree node is a field node. The target dependency list therefore indicates which child 
nodes of a particular target parent node will generate code. 

Referring also to Fig. 4B, compiler pass 2 (50) continues at step 64 wherein the 
target dependency lists are consolidated for the target tree record nodes. At step 66, 
child record nodes are added to the target dependency hsts of associated target tree 
record nodes if the target tree record node has a child field node with 
"GENERATESCODE" flag set and the target tree record node has a child record node 
with "GENERATESCODE" flag set. Pass 2 (50) then continues at step 68 wherein the 
source dependency lists for the target tree record nodes are consoUdated. This includes 
adding source dependencies of each node in the target dependency list to the current 
node source dependency list for target tree record nodes at step 70. 

Referring now to Fig. 4C, pass 2 (50) fiirther includes selecting source loop 
paths for target tree record nodes according to source dependency lists at step 72. The 
purpose of this selection is to determine which of the source nodes which create a 
dependency on a target node, defines the source path for which code will later be 
generated. This comprises iterating through the source nodes in the consolidated 
source dependency list for target tree record nodes, and finding the highest level loop 
point for which maxoccurs = * for the source node at step 74. The attribute maxoccurs 
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may be used to indicate the maximum number of times that a particular source node 
can appear in a document. The values for maxoccurs are "1" and where a value of 
1 indicates that only one occurrence is possible and a value of * indicates that more 
than one occurrence of a node is possible. In similar fashion, a minoccurs attribute may 
be provided for each node, wherein a value of "0" means the node might not appear in a 
document, and a value of "1" indicates that the node must occur in the document. 

The source loop path node may be used in a subsequent code generation pass 
(e.g., pass 4 (250) as illustrated in Figs. 9A-9D and described in greater detail infra) as 
the point aroimd which the generated code will iterate or loop. As an example, where 
the generated code is XSL, the source loop path node may be used in generating <xsl : 
for-each> code. 

As used herein, the term highest level refers to the hierarchy of the tree structure 
of the source and/or target schema, wherein a node level refers to the number of 
hierarchy levels separating a particular node from its root node. For example, a node in 
a tree directly connected with its root node is at level one, wherein a node having 
connection to a root node through three intervening nodes is at level 4, etc. 

A loop point or source loop path node is a source tree node for which 
maxoccurs = If no loop point is found at step 76, the method continues as illustrated 
and described hereinafter with respect to Fig. 4G. If a loop point is found at step 76, 
decision step 78 determines whether the current loop point is at a higher level than a 
previous loop point. In this regard, in the iteration through the source nodes listed in 
the consolidated source dependency list for a target tree node, the compiler may 
identify a loop point, and later identify another. Thus, decision step 78 arbitrates 
between a previous and current loop point thus identified, to determine which one is the 
appropriate loop point to be stored in preparation for a subsequent code generation pass 
(e.g., pass 4 (250) of Figs. 9A-9D). For example, the desired loop point is the one that 
is at the highest level. 

If the current loop point is at a higher level than a previous loop point at step 78, 
the method proceeds as illustrated and described with respect to Fig. 4D. If not, 
decision step 80 determines whether the current loop point is at a lower level than the 
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previously found loop point. If this is trae, the method proceeds via Fig. 4E. If not, 
(the current loop point and the previous loop point are at the same level), the compiler 
method proceeds as shown and described with respect to Fig. 4F. 

Referring now to Fig, 4D, pass 2 (50) precedes at step 82 for the case where the 
5 current loop point is at a higher level than the previous loop point. Decision step 84 

determines whether the paths to the previous and current loop points converge. If not, 
the compiler method determines whether multi-loop paths exist at step 86, after which a 
compiler error may be generated at step 88. However, if the paths to the previous and 
current loop points do converge, the current node is stored in the source dependency list 
10 as the source loop path node at step 90. Pass 2 (50) then continues at step 92 wherein 

loop path selection is performed for the remaining target tree record nodes. 

Referring to Fig. 4E, pass 2 (50) is illustrated for the case where current loop 
r1 point is a lower level than the previous loop point beginning at step 100. Decision step 

;2 102 determines whether paths to the previous and current loop points converge. If not, 

Q 5 multiple loop paths exist and a compiler error may be generated via steps 104 and 1 06, 

respectively. If the paths converge, the previous node is stored in the source 
dependency list as the source loop path node for the current target node at step 108 after 
which pass 2 (50) continues at step 1 10 to perform source loop path selection for the 
5p remaining target tree record nodes. 

^?tO Referring also to Fig. 4F, pass 2 (50) is illustrated for the case where the current 

Q loop point and the previous loop point are at the same level beginning at step 112. If 

this is the case, decision step 114 determines whether the current and previous loop 
point nodes are the same node. If not, multiple loop paths exist and a compiler error 
may be generated at steps 116 and 118, respectively. If a current and previous loop 
25 point nodes are indeed the same, this node is stored in the source dependency list for 
the current target node as the source loop path node at step 120, Thereafter pass 2 (50) 
continues at step 122 for any remaining target tree record nodes. 

As discussed above, if no loop point node is found at step 76 of Fig. 4C for a 
particular target node, pass 2 (50) continues as illustrated in Fig. 4G at step 124. The 
30 current node is then stored in the source dependency list as the source loop path node at 
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step 126 before the source loop path selection of pass 2 (50) is continued at step 128 for 
remaining target tree record nodes. 

Referring to Fig. 4R, pass 2 (50) continues at step 130 after which decision step 
132 determines whether the source loop path node has been determined. If not, pass 2 
5 (50) source loop path selection is continued for remaining target tree record nodes at 

step 134. However, once the source loop path node has been determined decision step 
136 determines if the source loop path node is a field node. If not, the compiler stores 
the source loop path node in the compiler node associated with the current target node 
at step 138. If, however, the source loop path node is a field node, the parent record 
10 node of the source loop path node is stored as the source loop path node in the compiler 
node at step 140 prior to continuation of pass 2 (50) at step 134. Pass 2 (50) continues 
in this fashion to determine source node dependencies for the target tree nodes using 
Q depth-first tree traversal of the target tree schema in the manner described in Figs. 4A- 

2 4H. 

3 5 PASS 3 - HIERARCHY MATCHING 

ill Having determined a source loop point (and hence a source loop path) for the 

target tree nodes in pass 2 (50), Figs. 5A-5E illustrate an exemplary pass 3 (150) which 
may be used for hierarchy matching using depth- first tree traversal of the target tree 
-:|] schema. Pass 3 (150) may be employed as part of a multi-step or multi-pass method of 

'30 generating code representing a mapping between schemas, which may represent XML 
0 documents. The pass 3 hierarchy matching 150 begins at step 152, where after decision 

step 154 determines whether the current target node has a source loop path node. If 
not, no hierarchy matching is done aad the next target node is processed at step 156 and 
154. If the current target node has a source loop path node at decision step 154, a 
25 hierarchy match list is created for the target node at step 158 after which the compiler 

determines a consolidated compiler link option using links on target node dependency 
at step 160. 

A consoUdated compiler link option is a property associated with a mapping 
link to a target tree node, which may be defined by a user in creating the mapping. In 
30 the exemplary compiler method illustrated herein, the possible compiler link options 

may include, for example, flattening, top-down, and/or bottom-up. A compiler link 
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option may be associated with a link from a mapping to a particular target tree node via 
the mapping definition created by a user. Decision step 162 checks for conflicting 
compiler links. If such compiler link conflicts exist, step 164 may generate a compiler 
error. 

5 Referring now to Fig. 5B, hierarchy matching of pass 3 (150) continues at step 

166 after which decision step 168 determines whether the compiler hnk option is 
flattening, top-down, or bottom-up. Referring also to Fig. 5C, if the compiler link 
option is flattening, pass 3 (150) continues at step 170. Thereafter, the compiler 
traverses up the source tree from the source loop path node, adding each source node 
1 0 encountered to the hierarchy match Hst for the current target tree node at step 1 72, 

before processing the next target tree node beginning with step 156 of Fig. 5 A. 

Referring also to Fig. 5D, if the compiler link option is top-down, hierarchy 
matching pass 3(150) continues at step 174 with decision step 176 determining 
whether the target node level is greater than the source loop path node level. If so, the 

05 compiler traverses up the target tree until the levels match at step 178, and the source 

loop path node is added to the hierarchy match hst of each target node encountered in 

jfj traversing up the target tree at step 1 80. Thereafter, the compiler traverses up the 

^} source and target trees, adding source tree nodes to the hierarchy match lists of the 

target tree nodes level by level at step 182. 

^|0 If the target node level is not greater than the source loop path node level at 

i;j decision step 176, the compiler traverses up the source tree until the levels match at 

step 1 84 adding the source tree nodes encountered to the hierarchy match list of the 
target tree node at step 186. Thereafter at step 182, the compiler traverses up the source 
and target trees, adding source tree nodes to the hierarchy match lists of target tree 

25 nodes level by level. 

Referring now to Fig. 5E, the pass 3 hierarchy matching 150 continues at step 
188 for the case where the compiler hnk option is bottom-up. In this case, the compiler 
traverses up the source and target trees, adding source nodes to the hierarchy match 
lists of the target nodes until level 1 of either the source or the target tree is reached at 

30 step 190. Decision step 192 then determines whether level 1 of the source tree has been 

reached. If so, the compiler pass 3 (150) ends at step 196. If level 1 of the source tree 
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has not been reached at step 192, the compiler will traverse up the source tree and add 
the source tree nodes encountered to the hierarchy match list of the target tree node at 
step 198 before pass 3 (150) ends at step 196. 

Referring now to Fig. 6A, an example of the flattening hierarchy matching 
5 methodology of Fig. 5C is illustrated with respect to an exemplary mapping 200. The 
mapping 200 is illustrated for a source tree comprising root node Root 1, record nodes 
Rl and R2, and a field node Fl, and a target tree comprising root node Root 2, record 
nodes R3 R4 and R5, and a field node F2. The user defined mapping in this example 
includes a user drawn link between field nodes Fl and F2, wherein the mapping may be 
10 created, for example, via a graphical mapping definition tool (not shown). Assuming 
the source loop path node for target node R5 is R2, the compiler climbs up the source 
tree from the source loop path node R2 adding source nodes R2 and Rl to the hierarchy 
Q match list for the current target tree node R5, as illustrated in Fig. 6A by the dashed 

compiler hnk lines 202. The resulting hierarchy 203 is illustrated in Fig. 6A, wherein 
CJS the V symbol indicates a logical for-each operator. As an example, the hierarchy 203 

In may indicate that for each Rl and for each R2 record in a source document, the code 

!i; should generate an R5 record in the target document. 

Referring also to Fig. 6B, another exemplary mapping 206 is illustrated in 
ffi which it is assumed that the source loop path node for target node R5 is R3. According 

' |0 to step 172 of the methodology of Fig. 5C, the compiler climbs up the source tree from 
Q the source loop path node R3, adding source nodes R3, R2 and Rl to the hierarchy 

match list for the target node R5. The hierarchy matching in this case results in 
compiler links 208 illustrated in dashed-line form, with a representative hierarchy 209 
illustrated for the flattening hierarchy matching of the mapping 206. 
25 Referring now to Figs. 5D and 7A, an exemplary mapping 210 illustrates an 

exemplary hierarchy matching wherein the compiler hnk option is top-down in 
accordance with the invention. In this example, it will be noted that source tree node 
Rootl is at level 0, node Rl is at level 1, node R2 is at level 2, and field node Fl is at 
level 3. Similarly, in the source tree, root node Root2 is at level 0, node R3 is at level 
30 1, node R4 is at level 2, node R5 is at level 3, and field node F2 is at level 4. 
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Assuming that source node R2 is the source loop path node for target node R5, 
the compiler in this example would determine at step 176 of Fig. 5D that the target 
node level (e.g., 3) is greater than the source loop path node level {e,g., 2). 
Accordingly, the compiler traverses up the target tree until the levels match (e.g., to 
5 target node R4 at level 2) per step 178 and adds source loop path node R2 to the 
hierarchy match hst of each target node R5 and R4. Preceding to step 1 82, the 
compiler then traverses up the source and target trees, adding source tree node Rl to the 
hierarchy match Ust of target tree node R3 as illustrated by compiler link dashed lines 
212. The resulting hierarchy 213 for this example is illustrated in Fig. 213 with the V 
10 symbol indicating a logical for-each operator. 

]n the example mapping 214 of Fig. 7B, assuming that source node R3 is the 
source loop path node for target node R5, the compiler at step 176 of Fig. 5D 
O determines that the target node level (e.g., 2) is not greater than the source loop path 

fj) node level (e.g,, 3). Accordingly, the compiler traverses up the source tree until the 

y 5 levels match (e.g., source node R2 at level 2) per step 184, adding source tree nodes R3 

111 and R2 to the hierarchy match list of target node R5 per step 1 86. The compiler then 

Q traverses up the source and target trees, adding source tree node Rl to the hierarchy 

'■^ match hst of target tree node R4. Compiler links 2 1 6 illustrate the top-down hierarchy 

y1 matching in this example, along with an exemplary hierarchy 217. 

^^0 Examples of bottom-up hierarchy matching are illustrated in Figs. 8A and 8B. 

^ Referring also to Fig. 5E, it is assumed that source node R2 in Fig. 8A is the source 

loop path node for target node R5. At step 190, the compiler traverses up the source 
and target trees, adding source nodes to the hierarchy match lists of the target nodes 
until level 1 of the source tree is reached. In this case, the compiler will add source 
25 node R2 to the hierarchy match hst of target node R5, and add source node Rl to the 
hierarchy match list of target node R4. At step 192 it is determined that level 1 of the 
source tree has been reached (e.g., at node Rl). This bottom-up hierarchy matching 
results in the compiler links 220 illustrated in dashed lines, and the exemplary hierarchy 
221 illustrated in Fig. 8 A. 
30 Another exemplary hierarchy matching is illustrated in Fig. 8B wherein a 

mapping 222 includes a source node R3 which is the source loop path node for target 
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node R5. In this case, the compiler traverses up the source and target trees, adding 
source node R3 to the hierarchy match hst of target node R5, and source node R2 to the 
hierarchy match hst of target node R4 according to step 190 of Fig. 5E. Thereafter, 
decision step 192 determines that level 1 of the source tree has not been reached. 
5 Thereafter, the compiler at step 198 traverses up the source tree and adds source tree 

node Rl to the hierarchy match hst of the target tree node R4. This example of bottom- 
up hierarchy matching results in the compiler hnks 224 illustrated in dash hnes and the 
exemplary hierarchy 225 in Fig. 8B. 

PASS 4 - CODE GENERATION 
10 Referring now to Fig. 9A, an exemplary code generation pass 4 (250) is 

illustrated in which field-attribute traversal in employed. By this method, all child field 
attributes are traversed before field elements and records are traversed. It will be 
Q appreciated that although the exemplary code generation methodology illustrated in 

^ Figs. 9A-9D generates XSL or XSLT code, many other code languages may be 

05 generated by the methods and systems of the present invention, which represent a 

1^ mapping between documents and/or schemas. Beginning at step 252, the compiler 

processes the target tree root node in a pre-execute parent function (not shown) at step 
254, thereby generating a code header. Thereafter, a code trailer is generated in a post- 
execute parent fimction at step 256. 
^|0 Referring also to Fig. 9B, pass 4 (250) is continued at step 258, At step 260, 

□ record nodes are processed in the pre-execute parent fimction wherein the compiler 

generates <xsl:for-each> code using the hierarchy match list associated therewith, 
generates <xsl:iC> (conditional) code if there is an incoming link from a conditional 
fimctoid (e.g., as illustrated and discussed fiirther with respect to Fig. 14 infra) and 
25 generates a start tag code for the record at step 260. For example, <xsl:for-each> code 
is generated for each of the hierarchy matches in a hierarchy match list associated with 
a target tree node. 

A hierarchy stack (not shown) may be employed to determine what context the 
compiler is in at any given time during the compiler method. For example, the 
30 hierarchy matches from a hierarchy match list may be pushed onto the stack at step 

260, and removed therefrom (e.g., popped) as they are used in the code generation pass 
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of the method at step 262, as discussed in greater detail hereinafter. Once step 260 is 
completed, the compiler processes record nodes in the post-execute parent function at 
step 262. Therein the compiler generates </xsl:if> (conditional) code if there is an 
incoming link from a condition fimctoid, and generates <xsl:value-of> code if the 
5 incoming link is not from a conditional link by traversing the mapping to source nodes, 

generates code for constant node values, and generates </xsl : for-each> code using the 
hierarchy match hst. Where a stack is used to determine context and to generate 
relative path names, the stack may be popped at this point. 

Thereafter at step 264, leaf record nodes are processed in an execute leaf 
10 function, wherein the compiler generates <xsl:for-each> code using the hierarchy 
match list, generates <xsl:if> (conditional) code if there is an incoming Unk from a 
condition flmctoid, and generates a start tag code for the record. Also at step 264, the 

□ compiler generates </xsl:if> (conditional) code if there is an incoming link from a 

conditional flmctoid, generates <xsl:value of> code if the incoming link is not from a 

Q5 conditional link by traversing the (e.g., mapping) to source nodes, generates code for 

; jl constant node values, and generates </xsl : for-each> code using the hierarchy match 

% Hst. 

Referring also to Figs. 9C and 9D, pass 4 (250) continues at step 266 after 
m which field nodes are processed in the execute leaf function at step 268, where the code 

^|0 generates <xsl:value-o£> code by traversing the (e.g., mapping) to source nodes and 

Q generating code for constant node values before pass 4 (250) ends at step 270. In Fig. 

9D, the generation of <xsl:value-of> code by grid (e.g., mapping) traversal begins at 
step 272. At step 274 if the link to the target node is simple {e.g,, a hne directly 
connecting a source tree node with a target tree node in a visual mapping), the compiler 
25 will use the value of the source tree node to generate the code. If a flmctoid link has 

been defined with respect to the target node, the compiler will trace back to the source 
tree through all possible paths recursively and generate <xsl: variable 
name=. . .select=. . ./> code for each flmctoid at step 274. In this regard, where functoids 
are cascaded (e.g., mapping 380 of Fig. 13), the variable code generated for earlier 
30 occurrences of the same functoid in the same context may advantageously be reused via 
a functoid stack during the code generation pass 4 (250). 
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PASS 5 - FREE MEMORY 
Referring now to Fig. 10, once the code has been generated in pass 4 (250), pass 
5 (300) begins at step 302. In pass 5 (300), the compiler frees memory for compiler 
nodes associated with target tree nodes at step 304 and frees memory for compiler 
5 variable classes and memory associated with fimctoids at step 306 before pass 5 (300) 

ends at step 308. This ends the exemplary multi-pass compiler method comprising 
passes 30, 50, 150, 250, and 300, in which XSL or XSLT code may be generated to 
represent a mapping between document schemas. Many other embodiments of the 
invention are possible, including, for example, code generation passes generating code 
1 0 in other than XSL or XSLT languages. 

Fig. 1 1 illustrates an exemplary mapping 350 of a source tree 352 and a target 
tree 354. The source tree 352 may be a source schema that is a definition of an XML 
p document. Similarly, the target tree 354 may be a target schema representing another 

^ XML document. A mapping definition tool (not shown) allows a user to graphically 

05 specify data transformations firom the source tree schema 350 to the target schema 354. 

m The methods and system of the present invention may be used, for example, to take the 

z: visual mapping representations of Fig. 1 1 and create a coded representation of the map. 

In accordance with one aspect of the invention, the coded mapping representation may 
m be an XSL style sheet. 

^|0 The exemplary mapping 350 includes multiphcation fimctoids 356 and 358, as 

U well an addition fionctoid 360. In this example, the value of TAX ID NUMBER fl-om 

the source tree 352 is multiplied with the value of LOCATION QUALIFIER, using the 
multiplication fimctoid 356. In like manner, the value of TAX ID NUMBER is 
multiplied with the value of TAX EXEMPT CODE using the multiphcation functoid 

25 358, The result of these two multiphcations is fed as an input to the addition functoid 

360. The output of the addition functoid 360 is then linked to the REF03 field node in 
the target schema tree 354. It will be recognized in this regard that the compiler 
methods and systems of the present invention maybe used to generate codes according 
to mappings, graphical and otherwise, employing a variety of fimctoids, and that the 

30 scope of the current invention is not hmited to use with the fimctoids illustrated and 
described herein. 
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Referring now to Fig. 12, another exemplary mapping 370 is illustrated between 
a target tree 372 and a source tree 374. Referring also to Figs. 4A-4H, the mapping 370 
has multiple source loop paths (e,g., source tree nodes XI and REC0RD6 via target 
tree nodes REC0RD2 and REC0RD3) for which the compiler in pass 2 (50) may 
5 generate a compiler error during selection of correct source loop path. Likewise, in the 
exemplary methodology of Fig. 1, pass 2 at step 8 includes multiple source loop path 
detection. In accordance with one aspect of the present invention, the compiler in this 
example may generate an error or warning such as the following: 

1 0 Completed Compile. 

Warning: target node REC0RD2 has multiple source loop paths. 
Warning: target node REC0RD3 has multiple source loop paths. 
Q XSL Compiler-2 warnings. 

Q5 Referring now to Fig. 13, another exemplary mapping 380 is illustrated between 

ifl a source schema tree 382 and a target schema tree 384 having cascaded functoids. 

]il Using the compiler method illustrated in Figs. 3-5, a compiler (not shown) may 

generate a coded representation of the mapping 380 for execution in a computer system 
m (not shown). As an example, such a compiler may generate the following code in XSL 

^JO for the mapping 380: 



<xsl: stylesheet xmlns :xsl='http://www.w3 .org/1 999/XSL/Transfonn' 
xmlnsrmsxsl- urn: schemas-rmcrosofl-com.xslt' xmlas : var='um: var' 
xmlns:user='iini:user' version- 1 .0> 

2 5 <xsl : output method='xnil' indent^'yes' ormt-xml-declaration='yes' /> 

xsl:template niatch=V'> 

<xsl:apply-teniplates select='Xr7> 
</xsl:template> 
<xsl:template match— Xr> 
30 <X2> 

<xsl:for-each select=='Recordr> 
<Record3> 

<xsl: variable name='var:vr 
select='user:fctmathadd2(string(@Fieldl),string(@Field2))V> 

3 5 <xsl:attribute name=Tield6'xxsl: value-of select^'Svar: vr/></xsl:attribute> 

<xsl:variable name=Var:v2' 
select=='user:fctrmthinultiply2(string(@Field3),string($var:vl))V> 

<xsl:attribute name- Field7'><xsl: value-of select='$ var: v2V></xsl: attribute> 
</Record3> 
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</xsl:for-each> 

<xsl:for-each select='Recordr> 
<Record4> 

<xsl:variable name='var:v3' 
5 select='user:fctmathadd2(string((^ieldl),string(@FieW^ 

<xsl:variable name='var:v4' 
selecHuser:fctrmthmultiply2(string(@Field3),string($var:v3))7> 

<xsl:variable name- var:v5' 
select='user:fctmathdivide(strmg($ var: v4),strmg(ancestor: : *[ 1 ]/Record2/^ 
1 0 <xsl:attribute name='Field8'><xsl:value-of select='$var:v5Vx/xsl:attribute> 

<xsl:variable name- var:v6' 
select='user:fctmathsubtract2(strmg($var: v4),string(ancestor: : * [ 1 ]/Record2/@Field4))'/> 

<xsl:attribute name=Tield9'><xsl:valiie-of select='$var:v6V></xsl:attribute> 
</Record4> 
15 </xsl:for-each> 
</X2> 

</xsl:template> 

<msxsl:script language='VBScript' inplements-prefix='user'> 
<![CDATA[ 

20 Function FctMathAdd2( p_strparmO, p strparml ) 

If ( IsNumeric( p_strpannO ) And IsNiimeric( p strparml ) ) Then 
FctMathAdd2 = CDbl( p_strparmO ) + CDbl(P_strpaiml ) 

O Else 

m FctMathAdd2 = 

lis End If 

Q End Function 

: ^ Function FctMathMultiply2( p_strparmO, p strparml ) 

If ( IsNumeric( p_stiparmO ) And IsNumeric( p_strpRMl ) ) Then 

J30 FctMathMultiply2 = CDbl( p strparmO ) * CDbl( p_strparml ) 

Else 

FctMathMultiply2 = 

0 End If 

01 End Function 
^5 

Function FctMathSubtract2( p_strparmO, p strparml ) 
i3 If ( IsNumeric( p strparmO ) And IsNumeric( p strparml ) ) Then 

FctmathSubtract2 = CDb 1 ( p_strparmO ) - CDb 1 ( p_strparml ) 
™" Else 
40 FctMathubtract2 = 

End If 
End Function 

Function FctMathDivide( p__strA, p_strB ) 
45 If) IsNumeric( pstrA ) And IsNumeric( p_strB ) ) Then 

dblDenommator CDbl ( p strB ) 
If ( dbl Denominator = 0 ) Then 
FctMathDivide = 

Else 

5 0 FctMathDivide = CDb 1 ( p_strA ) / db 1 Denominator 

End If 

Else 

FctMathDivide = 

End If 
55 EndFunction 
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]]> 

</insxsl:script> 
</xsl:stylesheet> 

5 Given the following sample source instance: 

<X1> 

<Recordl Field="10" Field2= "20" Field3= "30"/> 

<Record2 Field4="40" Field5-"50"/> 

</Xl> 

10 

Transforming using the compiler generated XSL map, the sample target 
instance output is as follows: 

15 

<X2> 

<Record3 Field6="30" Field7-"900" /> 
<Record4 Field8="18" Field9-"860" /> 

</X2> 

20 

^% Referring now to Fig. 14, another exemplary mapping 390 is illustrated between 

a target tree 394 and a source tree 392, wherein the mapping 390 may include 
Cj conditional functoids 396 and 398 as illustrated in the Figure. In this example, the 

Jii conditional functoid 396 is linked to an ADDRESSQUALIFIER node in the source tree 

as 392, and to a BILLTOADDRESS node in the target tree 394. The functoid 396 has 

}^ another input (not shown) having a value of BT, and the other conditional functoid 398 

y : has a second input (not shown) with a value of ST. This configuration may be used, for 

:y example, at run time to determine whether the address qualifier field of a particular 

J2 source document is set to BT or ST, and to provide the company name, address line 1, 

30 and city associated with the document to either the BILLTOADDRESS or the 

SHIPTOADDRESS records of the target document, respectively. As an illustration of 
the compiler methodology regarding conditional functoids, the following exemplary 
compiler output code is provided in XSL: 

3 5 Starting Compile . , . 

<xsl: stylesheet xmlns:xsl= 'http://www.w3.org/1999/XSL/Tmsform' 
xmhis:msxsl='um:schemas-microsoft-com:xslt' xmhis:var='um:var' 
xmlns:user='um:user' version='1.0'> 

<xsI:output method='xmr indent-'yes' omit-xml-declaration- yes' /> 
40 <xsl:template match=V'> 

<xsl:apply-templates select='PiirchaseOrder'/> 
</xsI:template> 

<xsl:template match='PurchaseOrder'> 
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<TargetPurchase> 

<xsl:for-each select='Address'> 

<xsl: variable name='var:vr 
select='user:FctEqual(string(@AddressQualifier),"BT")'/> 
5 <xsl:iftest='$var:vr> 

<BillToAddress> 

<! — Comection from Source Node ' AddressQualifier' to Destination 

Node 'AddressQualifier'--> 

<xsl:attribute name='AddressQualifier'><xsl:value-of 
1 0 select='@AddressQualifierV></xsl:attribute> 

<! — Connection from Source Node 'CompanyName' to 
Destination Node 'CompanyName'-> 

<xsl:attribute name='CompanyName'><xsl:value-of 
select=' @CompanyName 7></xsI: attribute> 
1 5 <! — Connection from Source Node ' AddresLineT to 

Destination Node 'AddressLinel'-^ 

<xsl:attribute name='AddressLiner><xsl:value-of 
select=' @AddressLine 1 V></xsl:attribute> 

<! — Connection from Source Node 'City' to 

20 Destination Node 'City'^ 

<xsl : attribute name= ' City '><xsl : value-of 

select='@City'/></xsl:attribute> 

0 </BillToAddress> 
=a </xsl:if> 

fU5 </xsl:fbr-each> 

p <xsl:fr)r-each select='Address'> 

%\ <xsl:variable name='var:v2' 

1 n select-'user:FctEqual(string(@AddressQualifier), "ST")'/> 
^ <xsl:iftest='$var:v2'> 

;30 <ShipToAddress> 

'""^ <1 — Connection from Source Node 'AddressQualifier' 

;L. to Destination Node 'AddressQualifier'^ 

y <xsl: attribute name='AddressQualifier'><xsl:value-of 

y select='@AddressQualifierV></xsl:attribute> 

111 5 <! — Connection from Source Node ' CompanyName' to 

Destination Node 'CompanyName' ^ 
rj <xsl:attribute name='CompanyName'><xsl: value-of 

Q select='@CompanyNameV></xsl:attribute> 

<! — Connection from Source Node ' AddressLine 1 ' to 
40 Destination Node 'AddressLine 1'^ 

<xsl"attribute name- AddressLine 1 '><xsl: value-of 
select='@AddressLine 1 '/></xsl : attribute> 

<! — Connection from Source Node 'City' to 

Destination Node 'City'-> 
45 <xsl:attribute name='City'><xsl:value-of 

select+'@CityV></xsl:attribute> 

</ ShipTo Address> 

</xsl:if> 
</xsl:for-each> 
50 </TotalPurchase> 
</xsl:template> 

<msxsl:script language='VBScript' implements-prefix='user'> 
<![CDATA[ 

FunctionFctEqual( p_strA, p strB ) 
55 If Lcase( p_strA ) = "true" Then 

p_strA = true 
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Elself Lcase( p_strA ) - "false" Then 

p_strA = false 
End If 

If LCase( p_strB ) - "true" Then 
5 pstrB = true 

Elself Lcase(p_strB ) = "false" Then 

p strB ^ false 
End If 

If ( IsNumeric( p_strA ) And IsNumeric( p_strB ) ) Then 
1 0 p_strA - CDbl( p strA ) 

p_strB - CDbl( p_strB ) 
End If 

If (p_strA = p_strB ) Then 
FctEqual = true 
15 Else 

FctEqual = False 
End If 
Enc Function 

20 ]]> 

</msxsl:script> 
</xsl;stylesheet> 



§5 Given the following sample source instance: 



y1 <PurchaseOrder> 

ffl <Address AddressQualifier="AddressQualifier_r' CompanyName="CompanyNanie_l " 

Q AddressLinel="AddressLinel_l " City="Cit}^l ">ST</Address> 

3 0 <Address AddressQualifier=" AddressQualifier_2 ' ' CompanyName=" CompanyName_2 " 

AddressLine 1 =" Addressline 1_2" City=City_2">BT</Address> 

</PurchaseOrder> 



Transforming using the compiler generated XSL map, the sample target 
instance output is as follows: 



<TotalPurchase> 

<ShipToAddress AddressQualifier=" AddressQualifier_r' CompanyName="CompanyName_r' 
40 AddressLinel="AddressLmel_r' City="City_l" /> 

<BillTo Address AddressQualifier="AddressQualifer_2" ConpanyName="CompanyName_2 " 

AddressLinel="AddressLinel_2" City="City_2" /> 

</TotaIPurchase> 

45 

EXEMPLARY OPERATING ENVIRONMENT 
In order to provide a context for the various aspects of the invention, FIG. 15 
and the following discussion are intended to provide a brief, general description of a 
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suitable computing environment in which the various aspects of the present invention 
may be implemented. While the invention has been described above in the general 
context of computer-executable instructions of a computer program that runs on a 
computer, those skilled in the art will recognize that the invention also may be 
5 implemented in combination with other program modules. Generally, program 

modules include routines, programs, components, data structures, etc. that perform 
particular tasks or implement particular abstract data types. Moreover, those skilled in 
the art will appreciate that the inventive methods may be practiced with other computer 
system configurations, including single-processor or multiprocessor computer systems, 
10 minicomputers, mainframe computers, as well as personal computers, hand-held 

computing devices, microprocessor-based or programmable consumer electronics, and 
the Uke. The illustrated aspects of the invention may also be practiced in distributed 
p computing environments where tasks are performed by remote processing devices that 

.2 are linked through a communications network. However, some, if not all aspects of the 

W5 invention can be practiced on stand-alone computers. In a distributed computing 

IJ1 environment, program modules may be located in both local and remote memory 

storage devices. 

With reference to FIG. 15, an exemplary system for implementing the various 
m aspects of the invention includes a conventional server computer 520, including a 

^|0 processing unit 521, a system memory 522, and a system bus 523 that couples various 
O system components including the system memory to the processing unit 521 . The 

processing unit may be any of various commercially available processors, including but 

not limited to Intel x86, Pentium and compatible microprocessors from Intel and others, 

including Cyrix, AMD and Nexgen; Alpha from Digital; MIPS from MIPS 
25 Technology, NEC, IDT, Siemens, and others; and the PowerPC from IBM and 

Motorola. Dual microprocessors and other multi-processor architectures also can be 

used as the processing unit 521. 

The system bus may be any of several types of bus structure including a 

memory bus or memory controller, a peripheral bus, and a local bus using any of a 
30 variety of conventional bus architectures such as PCI, VESA, MicroChannel, ISA and 

EISA, to name a few. The system memory includes read only memory (ROM) 524 and 
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random access memory (RAM) 525. A basic input/output system (BIOS), containing 
the basic routines that help to transfer information between elements within the server 
computer 520, such as during start-up, is stored in ROM 524. 

The server computer 520 fiarther includes a hard disk drive 527, a magnetic disk 
5 drive 528, e.g., to read from or write to a removable disk 529, and an optical disk drive 
530, e.g., for reading a CD-ROM disk 531 or to read from or write to other optical 
media. The hard disk drive 527, magnetic disk drive 528, and optical disk drive 530 
are connected to the system bus 523 by a hard disk drive interface 532, a magnetic disk 
drive interface 533, and an optical drive interface 534, respectively. The drives and 
10 their associated computer-readable media provide nonvolatile storage of data, data 
structures, computer-executable instructions, etc. for the server computer 520. 
Although the description of computer-readable media above refers to a hard disk, a 
3 removable magnetic disk and a CD, it should be appreciated by those skilled in the art 

S that other types of media which are readable by a computer, such as magnetic cassettes, 

35 flash memory cards, digital video disks, Bernoulli cartridges, and the like, may also be 

n used in the exemplary operating environment, and further that any such media may 

fj contain computer-executable instructions for performing the methods of the present 

invention. 

1^ A number of program modules may be stored in the drives and RAM 525, 

Mo including an operating system 535, one or more application programs 536, other 

3 program modules 537, and program data 538. The operating system 535 in the 

illustrated computer may be the Microsoft Windows 2000 Server operating system, 
together with the before mentioned Microsoft Transaction Server. 

A user may enter commands and information into the server computer 520 
25 through a keyboard 540 and a pointing device, such as a mouse 542. Other input 

devices (not shown) may include a microphone, a joystick, a game pad, a satellite dish, 
a scanner, or the like. These and other input devices are often connected to the 
processing unit 521 through a serial port interface 546 that is coupled to the system bus, 
but may be connected by other interfaces, such as a parallel port, a game port or a 
30 universal serial bus (USB). A monitor 547 or other type of display device is also 

connected to the system bus 523 via an interface, such as a video adapter 548. In 
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addition to the monitor, computers typically include other peripheral output devices 

(not shown), such as speakers and printers. 

The server computer 520 may operate in a networked environment using logical 

connections to one or more remote computers, such as a remote cUent computer 549. 
5 The remote computer 549 may be a workstation, a server computer, a router, a peer 

device or other common network node, and typically includes many or all of the 

elements described relative to the server computer 520, although only a memory 

storage device 550 is illustrated in FIG. 15. The logical connections depicted in FIG. 

15 include a local area network (LAN) 551 and a wide area network (WAN) 552. Such 
10 networking environments are conrmonplace in offices, enterprise-wide computer 

networks, intranets and the Internet. 

When used in a LAN networking environment, the server computer 520 is 
p connected to the local network 551 through a netwo± interface or adapter 553. When 

;5 used in a WAN networking environment, the server computer 520 typically includes a 

i3S5 modem 554, or is connected to a conmiunications server on the LAN, or has other 
Ijl means for estabhshing communications over the wide area network 552, such as the 

%l Internet. The modem 554, which may be internal or external, is connected to the 

system bus 523 via the serial port interface 546, In a networked environment, program 
ffl modules depicted relative to the server computer 520, or portions thereof, may be 

=|0 stored in the remote memory storage device. It will be appreciated that the network 
Q connections shown are exemplary and other means of estabhshing a communications 

link between the computers may be used. 

In accordance with the practices of persons skilled in the art of computer programming, 
the present invention has been described with reference to acts and symbolic 

25 representations of operations that are performed by a computer, such as the server 

computer 520, unless otherwise indicated. Such acts and operations are sometimes 
referred to as being computer-executed. It will be appreciated that the acts and 
symbohcally represented operations include the manipulation by the processing unit 
521 of electrical signals representing data bits which causes a resulting transformation 

30 or reduction of the electrical signal representation, and the maintenance of data bits at 
memory locations in the memory system (including the system memory 522, hard drive 
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527, floppy disks 529, and CD-ROM 531) to thereby reconfigure or otherwise alter the 
computer system's operation, as well as other processing of signals. The memory 
locations where such data bits are maintained are physical locations that have particular 
electrical, magnetic, or optical properties corresponding to the data bits. 
5 Although the invention has been shown and described with respect to a certain 

embodiments, it will be appreciated that equivalent alterations and modifications will 
occur to others skilled in the art upon the reading and understanding of this 
specification and the annexed drawings. In particular regard to the various functions 
performed by the above described components (assemblies, devices, circuits, systems, 
10 etc.), the terms (including a reference to a "means") used to describe such components 
are intended to correspond, unless otherwise indicated, to any component which 
performs the specified function of the described component (ie., that is fimctionally 

rj equivalent), even though not structurally equivalent to the disclosed structure, which 

performs the function in the herein illustrated exemplary embodiments of the invention. 

05 In this regard, it will also be recognized that the invention includes a computer-readable 
medium having computer-executable instructions for performing the steps of the 
various methods of the invention. 

^= In addition, while a particular feature of the invention may have been disclosed 

|S with respect to only one of several embodiments, such feature may be combined with 

one or more other features of the other embodiments as may be desired and 

Q advantageous for any given or particular appHcation. Furthermore, to the extent that 

the terms "includes", "including", "has", "having", and variants thereof are used in 
either the detailed description or the claims, these terms are intended to be inclusive in 
a manner similar to the term "comprising" and its variants. 
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CLAIMS 



1 . A method of generating code representing a mapping between a source 
schema and a target schema, the mapping comprising data transformations from the 
source schema to the target schema, the source schema comprising a source tree 
having a source node and the target schema comprising a target tree having a target 
node, the method comprising: 

determining source node dependencies for the target node by tracing from the 
target node through the mapping to the source schema; 

matching hierarchy by generating a hierarchy match list for the target node; 

and 

generating code according to the hierarchy match list. 

2. The method of claim 1, ftirther comprising: 
initializing node dependencies memory comprising: 

allocating memory for a compiler node; 
associating the compiler node with the target node; 
allocating memory for compiler variable classes; and 
associating compiler variable classes with functoids. 

3. The method of claim 2, wherein determining source node 
dependencies comprises: 

creating a target dependencies hst for the target node; 

creating a source dependencies list for the target node; 

tracing back to the source tree through all possible paths recursively, adding 
source tree nodes to corresponding source dependencies lists; 

setting a flag for the compiler nodes associated with the target node and an 
associated parent compiler node if there is a path from the target tree node to the 
source tree; 
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adding the target tree node to the target dependency hst of an associated 
parent record node if the target node is a field node; 

consohdating the target dependency hst for target record nodes; 

consohdating the source dependency Ust for target record nodes; 

selecting a source loop path node for the target node according to the 
consolidated source dependency list; and 

storing the source loop path node in the compiler node associated with the 
target node. 

4. The method of claim 3, wherein consolidating the target dependency 
list for target record nodes comprises adding child record nodes to the target 
dependency hst if the target record node has a child field node with a flag set and the 
target record node has a child record node with a flag set, and wherein consohdating 
the source dependency list for target record nodes comprises adding source 
dependencies of each node in the target dependency hst to the target dependency list. 

5. The method of claim 4, wherein selecting a source loop path node for 
the target node according to the consohdated source dependency list comprises: 

iterating through source nodes in the source dependency list; 

finding the highest level loop point for which more than one occurrence of the 
source node is possible; 

storing the current node in the source dependency list as the source loop path 
node if no loop point is found; 

storing the current node in the source dependency hst as the source loop path 
node if the current loop point is at a higher level than the previous loop point and if 
the paths to the previous and current loop points converge; 

storing the previous node in the source dependency list as the source loop path 
node if the current loop point is at a lower level than the previous loop point and if the 
paths to the previous and current loop points converge; 
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storing the current node in the source dependency list as the source loop path 
node if the current loop point is at the same level as the previous loop point, and if the 
paths to the previous and current loop points converge; 

generating a compiler error if the paths to the previous and current loop points 
do not converge. 

6. The method of claim 5, wherein finding the highest level loop point 
for which more than one occurrence of the source node is possible comprises finding 
the highest level loop point for which maxoccurs^*. 

7. The method of claim 5, wherein storing the source loop path node in 
the compiler node associated with the target node comprises: 

storing the soiu'ce loop path node as the source loop path node in the compiler 
node if the source loop path node is a record node; and 

storing the parent record node of the source loop path node as the source loop 
path node in the compiler node if the source loop path node is a field node. 

8. The method of claim 7, wherein matching hierarchy by generating a 
hierarchy match list for the target node comprises: 

creating a hierarchy match Ust for the target node if the target node has a 
source loop path node; 

determining a consolidated compiler link option using links on target node 
dependencies; and 

matching hierarchy according to the consolidated compiler link option. 

9. The method of claim 8, wherein matching hierarchy according to the 
consolidated compiler link option comprises: 

climbing up the source tree fi-om the source loop path node, adding each 
source node to the hierarchy match list for the target node if the compiler link option 
is flattening; 
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traversing up the source tree until the source tree node level matches the target 
node level, adding source tree nodes to the hierarchy match list if the target node 
level is greater than the source loop path node level and if the compiler link option is 
top-down; 

traversing up the target tree until the source tree node level matches the target 
node level, adding the source loop path node to the hierarchy match list of each target 
node traversed if the source loop path node level is greater than the target node level 
and if the compiler hnk option is top-down; 

traversing up the source and target trees, adding source tree nodes to the 
hierarchy match Hsts of target nodes level by level if the compiler hnk option is top- 
down; 

traversing up the source and target trees, adding source nodes to the hierarchy 
match lists of target nodes until level 1 of either the source or the target tree is 
reached if the compiler link option is bottom-up; 

traversing up the source tree, adding the source tree node to the hierarchy 
match hsts of the target tree nodes if the compiler link option is bottom-up and if 
level 1 of the target tree is reached; and 

traversing up the source tree, adding the source tree nodes to the hierarchy 
match list of the target node. 

10. The method of claim 9, wherein generating code according to the 
hierarchy match Ust comprises: 

generating a code header for a root node; 
generating a code trailer for the root node; 
processing target record nodes in a preexecuteparent function, a 
postexecuteparent function, and an executeleaf function; and 
processing field nodes in the executeleaf function. 

1 1 . The method of claim 10, wherein processing target record nodes in a 
preexecuteparent function, a postexecuteparent function, and an executeleaf function 
comprises: 
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generating <xsl : for-each> code in the preexecuteparent function using the 
hierarchy match Ust; 

generating <xsl : if> code in the preexecuteparent function if there is an 
incoming link from a conditional functoid; 

generating a start tag code for the target record node in the preexecuteparent 
function; 

generating </xsl : if> code in the postexecuteparent function for record nodes 
if there is an incoming link from a conditional functoid; 

generating <xsl : value-ofi> code in the postexecuteparent function for record 
nodes if the incoming link is not from a conditional hnk by traversing a grid to source 
nodes, and generating code for constant node values; 

generating </xsl : for-each> code in the postexecuteparent function for record 
nodes using the hierarchy match list; 

generating <xsl : for-each> code in the executeleaf function for leaf record 
nodes using the hierarchy match list; 

generating <xsl : i£> code in the executeleaf function for leaf record nodes if 
there is an incoming link from a conditional functoid; 

generating a start tag code in the executeleaf function for leaf records; 

generating </xsl : if> code in the executeleaf function for leaf record nodes if 
there is an incoming link from a conditional functoid; 

generating <xsl : value-of> code in the executeleaf function for leaf record 
nodes if the incoming link is not from a conditional link by traversing the grid to 
source nodes, and generating code for constant node values; and 

generating <xsl : value-of> code in the executeleaf function for field nodes by 
traversing grid to source nodes, and generating code for constant node values. 

12, The method of claim 11, wherein generating <xsl : value-of> code 
comprises: 

using the value of a source tree node to generate code if the link is simple; and 
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tracing back to the source tree through all possible paths recursively and 
generating <xsl : variable name=^.,.select=..,/> code for each functoid if there is a 
flmctoid hnk. 

1 3 . The method of claim 1 2, wherein determining source node 
dependencies for the target node by tracing from the target node through the mapping 
to the source schema comprises depth-first tree traversal of the target tree, matching 
hierarchy by generating a hierarchy match list for the target node comprises depth- 
first tree traversal of the target tree, and wherein generating code according to the 
hierarchy match list comprises field-attribute tree traversal of the target tree. 

14. The method of claim 10, wherein processing target record nodes in a 
preexecuteparent function, a postexecuteparent function, and an executeleaf function 
comprises: 

generating looping start code in the preexecuteparent function using the 
hierarchy match list; 

generating conditional start code in the preexecuteparent fvmction if there is an 
incoming link from a conditional functoid; 

generating a start tag code for the target record node in the preexecuteparent 
function; 

generating conditional end code in the postexecuteparent fimction for record 
nodes if there is an incoming link from a conditional functoid; 

generating value code in the postexecuteparent function for record nodes if the 
incoming link is not from a conditional link by traversing a grid to source nodes, and 
generating code for constant node values; 

generating looping end code in the postexecuteparent function for record 
nodes using the hierarchy match list; 

generating looping start code in the executeleaf function for leaf record nodes 
using the hierarchy match list; 

generating conditional start code in the executeleaf function for leaf record 
nodes if there is an incoming link from a conditional functoid; 
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generating a start tag code in the executeleaf function for leaf records; 

generating conditional end code in the executeleaf function for leaf record 
nodes if there is an incoming link from a conditional functoid; 

generating value code in the executeleaf function for leaf record nodes if the 
incoming link is not from a conditional link by traversing the grid to source nodes, 
and generating code for constant node values; and 

generating value code in the executeleaf function for field nodes by traversing 
grid to source nodes, and generating code for constant node values. 

1 5 . The method of claim 2, wherein matching hierarchy by generating a 
hierarchy match Ust for the target node comprises: 

creating a hierarchy match hst for the target node if the target node has a 
source loop path node; 

determining a consohdated compiler link option using links on target node 
dependencies; and 

matching hierarchy according to the consolidated compiler link option. 

16. The method of claim 2, wherein generating code according to the 
hierarchy match list comprises: 

generating a code header for a root node; 
generating a code trailer for the root node; 
processing target record nodes in a preexecuteparent function, a 
postexecuteparent function, and an executeleaf function; and 
processing field nodes in the executeleaf function. 

17. The method of claim 1 6, wherein processing target record nodes in a 
preexecuteparent function, a postexecuteparent function, and an executeleaf function 
comprises: 

generating <xsl : for-each> code in the preexecuteparent function using the 
hierarchy match list; 
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generating <xsl : if> code in the preexecuteparent function if there is an 
incoming hnk from a conditional functoid; 

generating a start tag code for the target record node in the preexecuteparent 
function; 

generating </xsl : if> code in the postexecuteparent function for record nodes 
if there is an incoming Hnk from a conditional functoid; 

generating <xsl value-of> code in the postexecuteparent function for record 
nodes if the incoming link is not from a conditional link by traversing a grid to source 
nodes, and generating code for constant node values; 

generating </xsl : for-each> code in the postexecuteparent function for record 
nodes using the hierarchy match hst; 

generating <xsl : for-each> code in the executeleaf function for leaf record 
nodes using the hierarchy match list; 

generating <xsl : ifi> code in the executeleaf function for leaf record nodes if 
there is an incoming Hnk from a conditional functoid; 

generating a start tag code in the executeleaf function for leaf records; 

generating </xsl : it> code in the executeleaf function for leaf record nodes if 
there is an incoming Hnk from a conditional functoid; 

generating <xsl : value-o£> code in the executeleaf function for leaf record 
nodes if the incoming link is not from a conditional link by traversing the grid to 
source nodes, and generating code for constant node values; and 

generating <xsl : value-of> code in the executeleaf function for field nodes by 
traversing grid to source nodes, and generating code for constant node values. 

18. The method of claim 1 , wherein generating code according to the 
hierarchy match Hst comprises creating an XSL style sheet representation of the 
mapping. 

19. The method of claim 16, wherein processing target record nodes in a 
preexecuteparent function, a postexecuteparent function, and an executeleaf function 
comprises: 
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generating looping start code in the preexecuteparent function using the 
hierarchy match Ust; 

generating conditional start code in the preexecuteparent function if there is an 
incoming link from a conditional functoid; 

generating a start tag code for the target record node in the preexecuteparent 
function; 

generating conditional end code in the postexecuteparent function for record 
nodes if there is an incoming link from a conditional functoid; 

generating value code in the postexecuteparent function for record nodes if the 
incoming link is not from a conditional link by traversing a grid to source nodes, and 
generating code for constant node values; 

generating looping end code in the postexecuteparent function for record 
nodes using the hierarchy match list; 

generating looping start code in the executeleaf function for leaf record nodes 
using the hierarchy match list; 

generating conditional start code in the executeleaf function for leaf record 
nodes if there is an incoming link from a conditional functoid; 

generating a start tag code in the executeleaf function for leaf records; 

generating conditional end code in the executeleaf function for leaf record 
nodes if there is an incoming link from a conditional functoid; 

generating value code in the executeleaf function for leaf record nodes if the 
incoming link is not from a conditional link by traversing the grid to source nodes, 
and generating code for constant node values; and 

generating value code in the executeleaf function for field nodes by traversing 
grid to source nodes, and generating code for constant node values. 

20. A method for compiling a mapping between a source schema having 
source nodes associated therewith, and a target schema having target nodes associated 
therewith, comprising: 

determining source node dependencies for at least one target node by tracing 
from the at least one target node through the mapping to the source schema; 
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matching hierarchy by generating a hierarchy match hst for the at least one 
target node; and 

generating code according to the hierarchy match list. 

2 1 . The method of claim 20, further comprising: 
initializing node dependencies memory prior to determining source 

dependencies; and 

freeing node dependencies memory after generating code. 

22. The method of claim 20, wherein determining source node 
dependencies comprises picking a source loop path for each target node, and 
detecting multiple source loop paths. 

23. The method of claim 20, wherein determining source node 
dependencies comprises generating a source dependency Ust. 

24. The method of claim 22, further comprising generating a compiler 
error if multiple source loop paths are detected. 

25. The method of claim 20, wherein matching hierarchy comprises 
generating a hierarchy match list. 

26. The method of claim 20, wherein matching hierarchy comprises one of 
flattening, top-down, and bottom-up matching. 

27. The method of claim 20, wherein the source schema and the target 
schema are XML schemas, 

28. The method of claim 20, wherein generating code comprises creating 
an XSL style sheet representation of the mapping. 
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29. A system for generating code representing a mapping between a 
source schema and a target schema, the mapping comprising data transformations 
from the source schema to the target schema, the source schema comprising a source 
tree having source nodes and the target schema comprising a target tree having target 
nodes, the system comprising: 

means for determining source node dependencies for the target node by 
tracing from the target node through the mapping to the source schema; 

means for matching hierarchy by generating a hierarchy match hst for the 
target node; and 

means for generating code according to the hierarchy match hst, 

30. A computer-readable medium having computer-executable 
instructions for: 

generating code representing a mapping between a source schema and a target 
schema, the mapping comprising data transformations from the source schema to the 
target schema, the source schema comprising a source tree having a source node and 
the target schema comprising a target tree having a target node; 

determining source node dependencies for the target node by tracing from the 
target node through the mapping to the source schema; 

matching hierarchy by generating a hierarchy match list for the target node; 

and 

generating code according to the hierarchy match list. 
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ABSTRACT OF THE INVENTION 

A method for compiling a mapping between a source schema and a target 
schema is provided. The method comprises multiple passes, including determining 
source node dependencies, matching hierarchy, and generating code. The method 
may further comprise initializing node dependency memory prior to determining 
source node dependencies, and freeing node dependency memory after code 
generation. The compiler algorithm provides for compiling user-defined functions in 
the mapping into the compiled representation. Another aspect of the invention 
provides for generating an XSL code representation of the mapping. An XSL style 
sheet representation may thus be generated from a visual mapping in accordance with 
the invention. 
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CONTINUE PASS 4 



250 



I 



PROCESS RECORD NODES 
IN PREEXECUTEPARENTFUNCTION: 
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INCOMING LINK FROM A CONDITIONAL FUNCTOID 
(3) GENERATE START TAG CODE FOR THE RECORD 
(4) GENERATE </xsl : if> (CONDITIONAL) CODE IF THERE IS AN 
INCOMING LINK FROM A CONDITIONAL FUNCTOID 
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As a below named inventor, I hereby declare that: 

My residence, post office address and citizenship are as stated below next to my name, I believe I am the original, first 
and sole inventor (if only one name is Hsted below) or an original, first and joint inventor (if plural names are listed 
below) of the subject matter which is claimed and for which a patent is sought on the invention 
entitled: METHOD OF COMPILING SCHEMA MAPPING 

the specification of which: 

(a) X is attached hereto. 

(b) was filed on as Serial No. 0 / or 

Express Mail No. , as Serial No. not yet known, and was amended on 

(if applicable). 

(c) was described and claimed in PCT International Application No. filed 

on and amended under PCT Article 19 on (if any). 

I hereby state that I have reviewed and understand the contents of the above identified specification, including the claims, 
as amended by any amendment referred to above. 

I acknowledge the duty to disclose information which is material to patentability in accordance with Title 37, Code of 
Federal Regulations 'L56(a). 

PRIORITY CLAIM 

I hereby claim foreign priority benefits under Title 35, United States Code, '119 of any foreign application(s) for patent or 
inventor's certificate or of any PCT international application(s) designating at least one country other than the United 
States of America listed below and have also identified below any foreign application(s) for patent or inventor's 
certificate or any PCT international application(s) designating at least one country other than the United States of 
America filed by me on the same subject matter having a filing date before that of the application(s) of which priority is 
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(e) such applications have been filed as follows. 

EARLIEST FOREIGN APFLICATION(S), IF ANY FILED WITHIN 12 MONTHS 
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As a named inventor, I hereby appoint the following attomey(s) and/or agent(s) to prosecute this application and transact 
all business in the Patent and Trademark Office connected therewith. (List name and registration number) 

Himanshu S. Amin, Reg. No. 40,894; Gregory Turocy, Reg. No. 36, 952; 
Christopher P. Harris, Reg. No. 43,660; Eric M. Highman, 
Reg. No. 43,672; and Gary J. Pitzer, Reg. No. 39,334. 

Katie E. Sako, Reg. No. 32,628 and Daniel D. Grouse, Reg. No. 32,022. 
The undersigned to this declaration and power of attorney hereby authorizes the U.S. attomey(s) named herein to accept 
and follow instructions from: 

Name(s) of authorized representative(s) 

Address 



as to any actions to be taken in the Patent and Trademark Office regarding this application without direct communication 
between the U.S. attomey(s) and the undersigned. In the event of a change in the person(s) from whom instructions may 
be taken, the U.S. attomey(s) will be so notified by the undersigned. 
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(name and telephone number) 

Himanshu S. Amin 

Amin, Eschweiler & TUROCY, LLP Himanshu S. Amin 

24™ Floor, National City Center 

1900 East 9™ Street (216) 696-8730 

Cleveland, Ohio 44114 
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information and belief are believed to be true; and fiirther that these statements were made with knowledge that willfril 
false statements and the like are punishable by fine or imprisonment, or both, under Section 1001 of Title 18 of the 
United States Code, and that such willfiii false statements may jeopardize the validity of the application or any patent 
issued therein. 

Full name of sole or first inventor, if any: Shekhar N. Swamv 

Inventor's signature: 



Date: Country of Citizenship: India 



Residence: Beilevue. Washington 



Post Office Address: 5037 148*^ Avenue NE, Unit H206 

Bellevue. Washington 98007 
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Residence: Kirkland, Washington 
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